Efficient mapping of Applied Biosystems SOLiD sequence data to a reference genome for functional genomic applications

نویسندگان

  • Brian D. Ondov
  • Anjana Varadarajan
  • Karla D. Passalacqua
  • Nicholas H. Bergman
چکیده

UNLABELLED Here, we report the development of SOCS (short oligonucleotide color space), a program designed for efficient and flexible mapping of Applied Biosystems SOLiD sequence data onto a reference genome. SOCS performs its mapping within the context of 'color space', and it maximizes usable data by allowing a user-specified number of mismatches. Sequence census functions facilitate a variety of functional genomics applications, including transcriptome mapping and profiling, as well as ChIP-Seq. AVAILABILITY Executables, source code, and sample data are available at http://socs.biology.gatech.edu/

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast Mapping and Precise Alignment of AB SOLiD Color Reads to Reference DNA

Applied Biosystems’ SOLiD system offers a low-cost alternative to the traditional Sanger method of DNA sequencing. We introduce two main algorithms of mapping SOLiD’s color reads onto a reference genome. The first method performs mapping by adapting a greedy alignment framework. In such an alignment, reads are mapped to approximate genome positions, allowing for a pre-specified bound on sequenc...

متن کامل

Comparison of Sequence Reads Obtained from Three Next-Generation Sequencing Platforms

Next-generation sequencing technologies enable the rapid cost-effective production of sequence data. To evaluate the performance of these sequencing technologies, investigation of the quality of sequence reads obtained from these methods is important. In this study, we analyzed the quality of sequence reads and SNP detection performance using three commercially available next-generation sequenc...

متن کامل

The Pattern of Linkage Disequilibrium in Livestock Genome

Linkage disequilibrium (LD) is bases of genomic selection, genomic marker imputation, marker assisted selection (MAS), quantitative trait loci (QTL) mapping, parentage testing and whole genome association studies. The Particular alleles at closed loci have a tendency to be co-inherited. In linked loci this pattern leads to association between alleles in population which is known as LD. Two metr...

متن کامل

Functional Screening of Phosphatase-Encoding Genes from Bacterial Sources

Phosphatase (APase) enzymes including phytases have broad applications in diagnostic kits, poultryfeeds, biofertilizers and plant nutrition. Because of high levels of sequence diversity among phosphatases,an efficient functional screening method is a crucial requirement for the isolation of the encodinggenes. This study reports a functional cloning screening method for the iso...

متن کامل

Chromatin immunoprecipitation sequencing (ChIP-Seq) on the SOLiDTM system

and negative control libraries are created, the samples are sequenced on the SOLiD System. The short sequence reads from the SOLiD System are mapped against genomic sequences using the SOLiD System alignment tools available through the Applied Biosystems software development community (http://info.appliedbiosystems. com/solidsoftwarecommunity/) or third-party tools compatible with SOLiD sequenc...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Bioinformatics

دوره 24  شماره 

صفحات  -

تاریخ انتشار 2008